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Estimating Failure Probabilities 
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Abstract 



In risk management often the probability must be estimated that a random vector falls 
into an extreme failure set. In the framework of bivariate extreme value theory, we construct 
Qs ■ an estimator for such failure probabilities and analyze its asymptotic properties under natu- 

^S| ■ ral conditions. It turns out that the estimation error is mainly determined by the accuracy 

of the statistical analysis of the marginal distributions. Moreover, we establish confidence in- 
tervals and briefly discuss generalizations to higher dimensions and issues arising in practical 
applications as well. 





1 Introduction 

Suppose an insurance company has contracts in two related lines of business with all customers 
of an insurance portfolio (e.g., fire insurance and business interruption insurance for industrial 
Sy I customers). On top of quota reinsurances for both lines of business (possibly with different quotas) 
the remaining total loss from each incidence is covered by an excess of loss reinsurance (CAT-XL) 



that pays for the part of the total loss which exceeds a given high retention level R. If X and 
Y denote the original losses from a fire in both lines of business and 1 — ax and 1 — ay the 
corresponding quotas, then a claim occurs in the XL-reinsurance if axX + ayY exceeds R. For 
the purpose of risk management the reinsurer might be interested in the probability that the 
jv>( , insurance company will file a claim in case of a fire. If the retention level is high, then such the 
^ I claim probability cannot be estimated using simple empirical estimates, because in the past the 
retention has rarely (or never) been exceeded. 

In this paper a more general setting is considered. We are interested in estimating the probability 
that a pair of random variables {X, Y) will take on a value in some given "extreme" set. Similar 
problems arise naturally in many fields. For example, a coastal dike may fail if the vector build 
from the still water level and the wave heights lie in a certain failure set D (cf. Coles and Tawn, 
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1994, Bruun and Tawn, 1998, and de Haan and de Ronde, 1998). A financial option (like a down- 
and-out-put) may become worthless if the price vector of underlyings enters such a "failure set" . 
Finally, (part of) the principal of a catastrophe bond gets lost for the investors if a vector of triggers 
becomes too extreme. 

As there are insufficiently many observations available in the extreme failure set D to use standard 
statistical methods, extreme value theory is needed to estimate the failure probability P{{X,Y) G 
D}. The basic idea of multivariate extreme value theory is to assume that the suitably standard- 
ized componentwise maxima of the observed random variables converge to a non-degenerate limit 
distribution. It can be shown that this assumption is equivalent to the convergence of suitably 
standardized quantile functions of both marginal distributions and a condition on the dependence 
structure in extreme regions. 

To be more precise, denote the marginal distribution functions of X and Y by Fi and F2, re- 
spectively, and let Ui{t) := -Fj^(l - 1/t) with H'^is) := inf{x G M | H{x) > s} denoting the 
generalized inverse of an increasing function H. We assume that there exist real constants 7^, 
positive functions Oj and real functions bi such that for a; > and i € {1,2} 

li,n MiLlMfl . ^^1^. (1.1, 

t-^00 ai{t) -fi 

For 7i = read the right-hand side as logx. Note that the right-hand side is the [/-function 
of the generalized Pareto distribution (GPD) with distribution function 1 — (1 + 7jx)~^''''' for 
1 + 7iX > 0, that is to be interpreted as the standard exponential distribution function for 7, = 0. 
The parameter 7^ is the so-called extreme value index of the ith marginal. If it is positive, then 
the support of Fi is unbounded from above and 1 — Fi{t) roughly decays like the power function 
with exponent I/7J, while for 7j < the right endpoint x* := F^(l) of the support is finite and 
1 — Fi{x) roughly behaves like a multiple of (x* — x)~^'^^ as x t x*. 

The aforementioned extremal dependence condition can be given in terms of the standardized 
random variables 1 — Fi{X) and 1 — ^2(^)5 that are uniformly distributed on [0, 1] if the marginal 
distributions are continuous. More precisely, we assume the existence of a measure i' such that for 
z/-continuous Borel sets B C [0, 00)^ bounded away from the origin 

lim tP{{X, Y) € U{tB)} = u{B). (1.2) 

t— >oo 

Here and in what follows, for functions hi, /i2 which are defined on subsets of the reals, we define 
a function /i on a subset of M? by h{xi,X2) ■= (/ii(xi), /i2(x2)). The so-called exponent measure 
ly describes the asymptotic dependence structure between extreme observations X and Y. Its 
homogeneity property 

v{tB) = t-^v{B), (1.3) 

which holds for all Borel sets B C [0, 00)^ and all t > 0, will be pivotal for the construction of 
our estimator of the failure probability. (Seen from a different angle, we assume an approximate 
scaling law for the joint distribution of U*^{X,Y)] cf. Anderson (1994).) Further details about the 
extreme value assumptions can be found in de Haan and Ferreira (2006), Sections 1.2 and 6.1, or 
Beirlant et al. (2004), Chapters 2 and 8. 

We are interested in the situation that at most a few observations lie in the extreme failure set D 
which implies that in our mathematical framework the failure set D = Dn must depend on the 
sample size n such that the failure probability 

Pn:=P{{X,Y)eDn} 



tends to 0. To motivate an estimator of Pn based on independent copies {Xi,Yi), 1 < i < n, of 
(X, Y) first note that from (|1.2p we obtain the approximation 

^p{^c/^(x,y)es}^Ki?) (1-4) 

for any sequence k = kn ^ oo such that k/n -^ 0. To estimate pn using this approximation, we 

must replace V^ and u with suitable estimators. 

According to (jl.ip . we may approximate Ui{{n/k)x) for sufficiently large n by 

x'yi — 1 

Tn,i{x) := ai{n/k) h hi{n/k) (1.5) 

li 

and estimate it by 

X^i — I 

Tn,i{x) := ai{n/k) — \- bi{n/k), (1.6) 

7i 

where ai{n/k),bi{n/k) and 7j are suitable estimators for ai{n/k),bi{n/k) and 7^, respectively. 
(Later on, the joint asymptotic normality of these estimators is needed; see e.g. de Haan and 
Ferreira, 2006, Chapters 3 and 4, or Beirlant et al., 2004, Section 5.6. for examples.) Likewise, the 
generalized inverse functions {k/n)U^{x) can be estimated by 

T,:;,{x) := l + 7» ^ /)!. ' ) ■ (1-7) 

V ai[n/k) / 

Here and in the sequel, (1 + 7?/)^''*' is defined as e^ if 7 = 0. For 1 + 77/ < (or 1 + 72/ = and 
7 < 0) the term (1 + jy)^''^ is not well defined. If 7 is positive and y < — 1/7, then it may be 
interpreted as 0, while for 7 < and y > — 1/7 it may be defined to be 00. However, we will 
see that the precise definition of (1 + ^yY'"' for very small and for negative values of 1 + 72/ is 
not important in the present setting (provided it is taken to be a non-decreasing function of y), 
because the sets on which T^, i G {1,2}, are not well defined are asymptotically negligible. 

If, in (jl.4p . we substitute T^{xi,X2) := (T^]^(xi),T^2(3^2)) for the marginal transformation 
{k/n)U'^ and replace the probability in the left-hand side of (|1.4p by its empirical counterpart, 
we arrive at the following estimator of ly 



1 " 



k 

1=1 

with Ex denoting the Dirac measure with mass 1 at x. 

Now, again interpreting convergence ()1.2p (for t = en) as an approximation, we may estimate the 
failure probability as follows: 

Pn = P{{X,Y)^Dn} 

= P{{X,Y)eU{en-e-^U^{Dn))} 



1 



Cr 






v[e„^U^{Dn)) (1.9) 



l.(-^Tr(z?„; 

V'-{-^^n{Dn)) 

Pn. (1.10) 



The basic idea of this estimator is to blow up the failure set, after a standardization of the marginals, 
such that it contains sufficiently many observations to allow the estimation of its probability by 
an empirical probability. Here the constants k and e„, that control by how much the transformed 
failure set is inflated, can be chosen by the statistician. This must be done appropriately to balance 
two contrary effects. On the one hand, /ce„ must not be too small, such that the inflated standard- 
ized failure set n/ {ken)T^ (Dn) contains sufficiently many marginally transformed observations 
T^{Xi, Yi), and thus the empirical probability (|1.8|) is an accurate estimate of its expectation. On 
the other hand, the set e~^U*^{Dn) must be sufficiently extreme to justify approximation (jl.Op . 
In Section 3 we discuss a heuristic tool to ensure this balance. 

The main goal of the present paper is to establish the asymptotic normality of the estimator pn 
under conditions on the underlying distribution and the failure set which are easy to interpret and 
relatively simple to verify. In what follows we outline how to achieve this objective. 
Recall that, in our asymptotic framework, the failure set -D„ must become more extreme in the 
sense that it moves in the north-east direction as the sample size n increases to ensure that it 
contains at most a few observations. To make both coordinates comparable, we standardize the 
marginals using V^ and assume that U^{Dn) is essentially an increasing multiple of a fixed set S. 
That way we ensure that none of the coordinates dominates the other. More precisely, we assume 
that for different sample sizes the failure sets are obtained from one fixed set S C [0, oo)^ in that 
there exist constants d^ > tending to oo such that 

Dn = U{dnS)r\M' = {{Ui{dnx),U2{dny)) \ {x,y) £ S}nR^. (1.11) 

Note that from the analog to (jl.9p where e„ is replaced with dn one obtains d„ ^ v{S)/pn (see 
Lemma 14.91 for a precise proof of the assertion Pndn -^ ^{S))- Hence the model constants dn 
determine at which rate the failure probabilities tend to 0. 

The crucial idea in the analysis of the asymptotic behavior of p„ is to approximate the estimator 
by the empirical measure of a random transformation Hn[S) of the set S (with Hn defined in 
(|1.12p below) under the following analog to i)n (defined in (jl.Sp ) with the fitted GPDs replaced by 
the "true" ones: 

1 '^ 

i=l 

Since the GPD approximation of the marginals is accurate only in the upper tail (and to avoid 
the aforementioned problem with the definition of T^), we must first show that asymptotically it 
does not matter if we replace S with a suitably defined subset 5* that is bounded away from the 
coordinate axes. For this set, we may use the approximation 

Pn ~ —Un(—Hn{Sn) 

where the random transformation Hn of the marginals is defined by 

Hnix) := ^r„^ o fn o (fW)^ o U{dnX) (1.12) 

dn 



with 



k 

C = Cn ■■= -en (1-13) 



n 



and 

f<^)(x,y) = f„(c„x,c„y). (1.14) 

Check that by (jl.ip one has Hn{x) w {en/dn){Tn )^oU{dnx) ^ {en/dn){Tn )^oTn{{k/n)dnx) ss x 
(cf. Lemma |4.1|) . Using the homogeneity of z^, we wih thus break the estimation error into several 
parts as follows: 



Pn-Pn = Pn Un[—Hn{Sl 



1 



[Uy. 



e-r. 

1 



[B) - EUn{B)) \B={d„/er,)H„(S^) 

— {EVn{B) - v{B))\B={d„/e„)H^(Sl) 
fin 

+ Uv{Hr.{Sl)) - v{Sl)) 

O'n 

+ Uv{Sl)-v{S)) 

+ v{dnS) - Pn 

=: I + II + III + IV + V + VI. (1.15) 

It will turn out that, under suitable conditions, part IV dominates all the other terms. Its asymp- 
totic behavior is largely determined by the asymptotics of the marginal estimators if i' is sufficiently 
smooth. 

Under very weak conditions on the set S, we will show that the terms / and V are negligible, if 
5* is defined suitably. If dn/cn is bounded and bounded away from 0, then using methods from 
empirical process theory the second term can be shown to be asymptotically negligible. Part VI 
is a bias term which is negligible if d„ is sufficiently large (depending on the rate of convergence in 
(fO]) ). Similarly, the term ///, that equals {{n/k)P{T^{X, Y)eB}- u{B))/dn for B = Hn{S*^), 
describes a bias term which is asymptotically negligible if both the approximation ()1.4p and the 
marginal approximation U{{n/k)B) ~ Tn{B) are sufficiently accurate. 

An estimator related to pn has been suggested and analyzed by de Haan and Sinha (1999) in a 
much more restrictive framework. In particular, it has been assumed that the same number kn of 
largest order statistics is used for both marginal fits, which is inefficient if the GPD approximation 
(cf. ()2.ip below) is less accurate for one of the marginal distributions. Likewise, the flexibility of 
the estimator is increased in the present paper by allowing that the blow-up factor e„ deviates 
from the unknown model constant (i„, while de Haan and Sinha (1999) used a consistent estimator 
of d„, which was made identifiable by fixing some point on the boundary of S. 

Moreover, the shape of the failure set considered by de Haan and Sinha is restricted. E.g. the case 
q{oo) = (in our notation; cf. condition (Q2) below) is ruled out by condition (2.9) of that paper. 
The model assumption 

Dn:={{s,t) \f{s/xn,t/yn)>l} 

for some function / and sequences of normalizing constants Xn and Hn seems quite restrictive 
and unnatural, because it allows the failure set to tend towards the "north-east" only by a linear 
scaling of both marginals. This parametrization does not fit well to extreme value theory if the 
extreme value indices are not positive, which is usually the case in environmetrics, arguably the 
most important field of application of our theory. Even more troublesome is the fact that by 



assumption (1.5) of de Haan and Sinha (1999) the failure set is described in terms of the number 
kn of largest order statistics that is picked by the statistician. Hence the model parametrization 
depends on the statistical procedure used to analyze the model, which makes it extremely difficult 
to interpret. 

Finally, while the influence of each marginal transformation is clearly separated in the description 
of the limiting distribution in our main Theorem 12. 11 in Theorem 4.1 of de Haan and Sinha (1999) 
the marginal parameters are seemingly intermingled. Therefore, the generalization of the present 
results to higher dimension is much more straightforward than those of de Haan and Sinha (see 
the discussion at the end of Section 2). 

An alternative to our genuinely multivariate estimator can be constructed by the so-called struc- 
tural variable approach if the failure set is of the form Z)„ = {{s,t) \ h{s,t) > t„} for some known 
function h and threshold t„. Then one may apply techniques from univariate extreme value theory 
to the pseudo-observations h{Xi,Yi), 1 < i < n (cf. Coles (2001), §8.2.4 and page 156, or Bruun 
and Tawn (1998)). However, even for this class of failure sets, an analysis of the dependence 
structure between the two components of the observed vectors is of independent interest, and it 
seems more natural to use the same approach for model fitting and for the estimation of quantities 
like failure probabilities. Moreover, often one wants to estimate the failure probability for several 
different sets (e.g., to find the cheapest construction to ensure a certain level of safety); in this case 
it is both more efficient and more natural to use estimators in a unified framework as considered 
in the present paper. 

In the multivariate approach. Coles and Tawn (1994) and Bruun and Tawn (1998) used parametric 
models for the dependence structure in the closely related problem to estimate a parameter defining 
a failure set such that the corresponding failure probability equals a given value. However, usually 
there is no physical reason for such parametric models. By using them nevertheless one trades a 
modeling error, that is difficult to assess, for an estimation error, which can be quantified at least 
asymptotically (see Theorem 12.11 below). Indeed, Davison (1994) suggested in the discussion to 
Coles and Tawn (1994) that for sufficiently large sample sizes a nonparametric approach may be 
advisable. 

Finally, we would like to mention that the assumptions used in the present paper rule out that the 
exponent measure i^ concentrates on the coordinate axes (i.e., here X and Y are assumed asymp- 
totically dependent in the sense of multivariate extreme value theory). In the case of asymptotic 
independent coordinates X and Y, consistency of an analogous estimator for the failure probability 
was proved by Draisma et al. (2004), while its asymptotic normality was established by Miiller 
(2008). 

The paper is organized as follows: In Section 2 we first introduce and discuss in detail the framework 
in which we then prove asymptotic normality of our estimator of the failure probability. Moreover, 
we propose a consistent estimator of the limiting variance and derive an asymptotic confidence 
interval. In Section 3 we apply the theory to the motivating example given at the beginning. In 
this context, we also discuss the roles of k and e„ and propose a heuristic approach for choosing 
those numbers. All proofs are collected in Section 4. 



2 Main results 



We will make the following assumptions about the marginal distributions and the estimators of 
the marginal parameters: 

(Ml) There exist constants x^ < F,^{1) such that Fi is continuous and strictly increasing on 
[x°,i^^(l)]nMforiG {1,2}. 

(M2) For all i € {1, 2}, there exist normalizing functions Oj > 0, 6j € M and Ai ^ and constants 
Pi < such that for all a; > 

Ui{tx)-bi{t) zTJ-l 



lim 



ai{t) 



Ait) 



logx, 7j + pj = 



(M3) 



weakly. 



,i/2(ai{n/k) hi{n/k) -bi{n/k) , 

fc I — - — TTT-l, 7 — TTT ,7i-7i 



ai{n/k) 



ai{n/k) 



l<i<2 



(ai,/3i,rj)i<j<2 



Condition (Ml) is not essential, but it is assumed to simplify the proofs and the formulation of 
some technical results (cf. de Haan and Ferreira, 2006, Theorem B.3.13). 

(M2) is the usual second order condition with the additional restriction that the second order 
parameters pi are negative. Again, one may drop the latter assumption at the cost of additional 
technical complications. According to Corollary 2.3.7 of de Haan and Ferreira (2006) we may and 
will assume that the normalizing constants are chosen such that the following uniform version 
holds: For all e, (5 > there exists to such that 



Ui(tx)-bi{t) _ xii-l 
ai(t) 7i 



A^{t) 



i,^ 



ii,Pi 



< Sx^'^P' max(x^x-^) =: 5x^'^P'^' 



provided t,tx > to- In fact, the main results hold under the following weaker assumption: 



Ui{tx) - bi{t) x"/- - 1 



ai{t) 



li 



OiA,{t)x 



■yi+Pi±£ 



) 



(2.1) 



(2.2) 



as t — 7- oo uniformly for x > to/t. Note that, under condition (M2), Ai is regularly varying with 
index pj. 

Condition (M3) gives a lower bound on the rate at which the marginal estimators converge. Notice 
that some of the limiting random variables may be equal to almost surely. In particular, this 
will usually be the case, if the ith marginal estimators use ki largest order statistics and k = o{ki). 
However, typically at least some of the limiting random variables are non-degenerate and jointly 
normal distributed. In the sequel, we will choose versions such that the convergence in (M3) holds 
in probability. 

The failure set Dn has to satisfy the following conditions. 
(Ql) There exists a set 

S = {{x,y) C [0,oo)2 I y>q{x)yx G [0,oo)} C [0, oo)^ 



and constants dn > tending to oo such that 

Dn = U{dnS)nR^ = {{Ui{dnx),U2{dny)) \ {x,y) G S}nR\ 

Here q : [0, oo) — t- [0, oo] is some function that is monotonically decreasing and continuous from 
the right with q{0) > 0. 

(Q2) 

x^ "Ti)/ I log 2;[ = 0{q{x)) as x J, x/ := inf {x > | q{x) < oo} 
y( -72)/ |logy| = 0{q^{y)) as y ]^ q{oo) := hm q{x). 

In particular, condition (Ql) ensures that one may define the generahzed (right-continuous) inverse 
function (of a decreasing function) in the usual way: 

q'^iv) := inf{x > [ g(x) < v} 

with the convention inf = 00. The conditions (Q2) are always fulfilled if 71 < 1 or xi > 0, resp., 
if 72 < 1 or q'(oo) > 0. 

Moreover, we need some conditions on the extremal dependence between X and Y. Recall that 
asymptotically the extremal dependence is described by the exponent measure ly defined in (|l.'2p . 
In view of (jl.ip . one may replace the standardization by U with a standardization using r„. To 
bound the bias terms III and VI in (jl.lSp . we must specify the rate of the resulting convergence 
towards v. 

(Dl) There exist an exponent measure i^ on [0, 00)^ and a function Ao^t) > converging to as 
t tends to 00 such that 



n 



uniformly for all sets B E Bt„^M for tn = n/k and for t„ = dn and arbitrary M > 0. 

Here, Bt„^M consists of all sets of the form {{H^'^, c.{xi))ie{i,2} I (3;i,X2) € C} with C = S 
[u,oo) X [u,oo) or C = [xi,u) x [q{u—), 00) or C = [q'^{v),oo) x [q(oo),v) for some u,v > 
and some ^i,Xi:d ^ [—M,M] if t„ = n/k, and Bt„^M comprises all sets of the form {((1 + 
li{Ui{dnXi) - bi{dn)) / ai{dn)Y/'^')ie{i,2} I (a;i,2;2) € C] with C = [xi,u) x [q{u-), 00) or C = 
[g"^(f),oo) X [q{oo),v) for some n, u > if t„ = dn- Here, for i € {1,2}, 



ff^"''^ Cxi 






1/7. 



(2.3) 
(D2) The exponent measure has a strictly positive, continuous Lebesgue density rj. 

Alternatively, the dependence may be described by the pertaining spectral measure <I> on [0, 7r/2] 
defined by 

$([0,^]) = z/|(x,y) G [0,oo)2 x2+y2 > i^arctan- <t?|, -(? G [0,7r/2]. 



Condition (D2) is equivalent to the assumption that ^ has a continuous Lebesgue density ip 
that is strictly positive on [0, 7r/2]. In particular, this assumption rules out that X and Y are 
asymptotically independent (in the sense of multivariate extreme value theory), because then the 
spectral measure is concentrated on {0,7r/2}. The relationship between r/ and (p is given by 

n -I- 11 \~ ' inl arrtan — 



ri{x,y) = (x-^ +y^y'^''^iplaTctan-\, x,y > 0. (2.4) 

In (Dl) the rectangles can also be replaced with the subsets Sn ((0, u) x (0, oo)) , resp. Sn ((0, oo) x 
(0, v)) . It is easy to see that condition (Dl) is met if {X, Y) has a density / which satisfies the 
following approximation 



sup 



(a:,j/)e(0,oo)2,xVj/>l '^K^iU) 



tai{t)a2{t)x^"-V"-^f{ai{t) 



1 h, (f\ n-Jf^ 


— + b2it)) 


1 Ol{l),a2{i) 

71 71 


-r]{x,y) 


= 0{Ao{t)) 



for some weight function w which is Lebesgue-integrable on {{x,y) E (0,oo)^,x V y > 1}. This 
sufficient condition applies e.g. to the bivariate Cauchy distribution restricted to (0, cxo)^ and to 
densities of the form f{x, y) = 1/(1 + x°- + y^) with a, /3 > 1 such that (3 > a/{a — 1). 
Finally, we impose the following conditions on the sequences dn, Cn and k = kn- 

(51) k — )■ c«, n = 0(e„) (so that k = 0{cn) with c„ = enk/n — ;■ oo), dn ^ e„ (i.e. < 
liminf(i„/e„ < limsup„^oo (i„/e„ < oo), and Wni'^i) = o{k^''^) for i E {1,2} with 

{logCn, 7j > 

ilog^Cn, 7i = 

{dnk/n)-^\ 7i<0. 

(52) A,{n/k) = o{k-^/^Wn{7i)) for i £ {1,2} and Ao{n/k) = o{k-^/^ max{wn{ji),Wn{72))) 

(53) A;i/2 = 0{cn V cl') if 7j > for i G {1, 2}, 
A; V2 = o{ci~"'^ ) if 71 < and xi = 0, and 
A;V2 = o(ci~^2) if 72 < and g(oo) = 0. 

Recall that dn is a constant determined by the model, which describes the rate at which the 
probability pn to be estimated tends to 0, while e„ is chosen by the statistician such that the 
inflated failure set contains sufficiently many observations. It seems natural to choose e^ of the 
same order as d„, because this way one compensates for the shrinkage of Dn- More precisely, 
dn ^ e„ if and only if the expected number of transformed observations in the inflated transformed 
failure set is of the same order as k, which can easily be checked in practical applications. To see 
this, note that by ()1.4p . ()1.3p and (jl.lip this expected number equals 

nP|Tr(X,y) G ^f^{Dn)} ^nP{-U^{X,Y) E lu^{Dn)} ^ ku(^s) = k^u{S). 

^ KGn ^ ^71 Gn ^ ^ 6n ^n 

We would like to emphasize, though, that this condition can be substantially weakened at the price 
that one needs different conditions for different combinations of signs of 71 and 72. 
The first condition of (SI) ensures that the expected number of marginally standardized observa- 
tions in the inflated standardized failure region tends to 00, whereas the second condition means 



that the expected number of observations in the failure region remains bounded as n — > oo. The 

last condition of (SI) is needed to ensure consistency of the estimator in the sense that pn/Pn -^ 1- 

Note that it can only be satisfied if min(7i,72) > —1/2. This restriction on the extreme value 

indices usually arises if one wants to prove asymptotic normality for estimators of tail probabilities; 

cf., e.g., de Haan and Ferreira (2006), Remark 4.4.3, or Drees et al. (2006), Remark 2.2. 

Prom (S2) it follows that the bias is asymptotically negligible, while (S3) will imply that the part 

of the set S near the axes (corresponding to observations where one of the coordinates is much 

larger than the other) does not play an important role asymptotically. Similarly as above, these 

conditions may also be substantially weakened at the price of much more complicated conditions 

on the behavior of q depending on 71, 72 and rj. 

Under these conditions we establish an asymptotic approximation of the estimator p„. 

Theorem 2.1. If the conditions (M1)-(M3), (Dl), (D2), (Ql), (Q2) and (S1)^(S3) are fulfilled, 
then 



k^/'^dn{Pn -Pn 



:t - /3i - ^) I,°^o.)(l^(v)y-^^v(q'-(v),v) dv, 71 < 
-^1 I^oc) Q'~('^)v(q^(v),v) dv, 71 = 

12 Jxi 

"2 a ^2 



-§ L7 9(^)^(^' ^N) ^^' 72 > 

^,, (32-^)J^{q{u)y-^^r^{u,q{u))du, 72 < 

"^2 1^ q(u)v(u, q(u)) du, 72 = 



+ Op{wn{^l) V 'u;n(72)) (2-5) 

Since Pndn — ^ ^(S), Theorem 12.11 remains true when the left-hand side of (12. Sp is replaced with 

k^/MS)(Pn/Pn-l)- 

The weights Wn(7i) and 10^1(^2) on the right-hand side of ()2.5p may be different, and then they 
converge to 00 at different rates. More precisely, Wnij) is a non-increasing function of 7, and it 
is strictly decreasing on (— oo,0]. Therefore, the smaller of both marginal extreme value indices 
71 and 72 determines the rate of convergence of pn towards pn ■ If at least one of the indices is 
non-positive and the indices are not equal, then the summand corresponding to the larger index is 
negligible. (In that case, it may happen that one cannot prove asymptotic normality using Theorem 
12. H because the limiting random variables aj,/3i and Fj pertaining to the smaller extreme value 
index are equal to 0; cf. the above discussion of condition (M3).) 

If both extreme value indices are positive, then both main terms on the right-hand side of (12. 5p 
are of the same order. In that case, {k^ '"^dn/ log Cn){pn —Pn) converge to a limit distribution which 
typically will be non-degenerate if at least one of the limiting random variables Fi and F2 in (M3) 
is non-degenerate. If they are jointly normal, then we may derive the asymptotic normality of the 
estimator for the failure probability pn- 

Theorem 12.11 can be used to construct asymptotic confidence intervals. To this end, it is advisable 
to reformulate the assertion as a convergence result on k^''^en{pn —Pn)-, because dn is unknown. 
Then one needs consistent estimators for the variance of the random variables occurring on the 
right-hand side of (j2.5p which usually are asymptotically normal, and consistent estimators for 
Cn/dn times the integral there. 
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We will outline how to estimate the term {sn/dn) /^ q{u)ri{u, q{u)) du, that is needed in the case 
7 > 0. To avoid the estimation of the density ry of z/, we approximate the integral by the z^-measure 
of a shrinking set as follows. Because r] is continuous, for small £„ one has 

■J- q{u)ri{u,q{u))du^ -^ -— / r]{u,v) dv du = --{iy{S;^^) - iy{S+2)) 

"n Jxi ""11 Jxi ^'^n J {l~(.n)q{u) ^'^n 

with 

Now one can proceed similarly as in ()1.9p (using (ll.4p and (II. IIP ) to construct an estimator of 
{en/dn) f^ q{u)'i]{u, q{u)) du: 

Corollary 2.2. Let 

5J2 := {{n,{l±£n)v) I {u,v) G ^f^ (!?„)} 

for some sequence ^n i suc/i that k~^''^{wn{'yi),Wn{'y2)) = o{£n)- Suppose that all conditions 
of Theorem \2.1\ are fulfilled and, in addition, that an analog to condition (Dl) holds where Cn is 
replaced with Cn/iX ± ^n)- Then 

f ^n{S-2) - Z>n(5+2) ^^n T , . , . ^^, ^ ^ (^\^ 

in,2 ■■= —. = w~ / iw^y^^ ^w) "^(1 + ^^(1))- 

^<-n ^n Jxi 

In a completely analogous way one can estimate {en/dn)J°^^q^{v)rj{q*~{v),v)dv by In,i '■= 
(M„(5-i)-z>„(4ti))/(24)with 



S±i := {((1±^„K«) I {u,v) G ^frpn)}. 



Now suppose that both extreme value indices 7^ are positive and that we estimate them by the Hill 
estimator, i.e., 71 = k^ J2iLi ^c>g{Xn-i+i:n/ ^n-ki-.n) with Xn-i+i-n denoting the ith largest order 
statistic among Xi, . . . ,Xn, and likewise 72 = A;^ X]j=i log(yn-i+i:n/^n-fc2;n)- It is well known 

1 /9 1 /9 

that k- {-ji — 7j) — )• A/(o,^2) if condition (M2) holds and k^ Ai{n/ki) — )• 0. In particular, Fj = 
if fc = o{ki). However, if ki/k — )• k^ S (0, 00) for both i = 1 and i = 2, then the joint distribution 
of Fi and r2 is needed for the construction of confidence intervals. 

In the case ki = k2 = k, de Haan and Resnick (1993) derived a representation of Fj in terms of 
a Gaussian process under slightly different conditions than used in the present paper. One may 
mimic their approach to show that under our conditions, (Fj/7j)j£ m ^2} ^^^ the same distribution as 
(( /i°° i~^^i{'t/i^i) dt—Wi{l/Ki))/Ki)^r-^ 21 where (Wi, W2) is a bivariate centered Gaussian process 
with covariance function given by Cov{Wi{s), Wi{t)) = iy[{s\/t, 00) x (0, 00)) , Cov{W2{s) ,W2{t)) = 
z^((0,oo) X (sVt, 00)) and Cov{Wi(s),W2{t)) = z^((s,oo) x (t,oo)). Direct calculations show that 
thus (Fj/7j)jg{i 2} is a centered Gaussian vector with marginal variances 1/kj and covariance 
i'y{K2,oo) X (ki,oo)). Hence, with 2;i_q,/2 denoting the standard normal (1 — a/2)-quantile and 

(T^ := in 1/1^1 + in 2/ 1^2 + 2z>„((k2,Oo) X (ki , Oo))/„,ii„,2, 



Pn 



k ^^"^ e J log CnaZi_a/2,Pn + k ^/^e„MogCnO-Zi_a/2 (2.6) 
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is a two-sided confidence interval for p„ with asymptotic confidence level 1 — a. (This formula is 
also applicable if one of the kj equals oo.) 

As an alternative to the above approach, one may estimate the density of the spectral measure $ 
(cf. Cai et al., 2011) and construct both an estimator for the integrals and for the joint distribution 
of the limiting random variables on the right-hand side of (j2.5p from it. 

We conclude this section by indicating how to generalize the main result to M'^-valued vectors 
Xj = {Xi^i, . . . , Xi^2) of arbitrary dimension d >2, albeit a detailed discussion is beyond the scope 
of this paper. An inspection of the proof of Lemma 14.31 reveals that the generalized inverse q'^ of 
the function q is used to describe the boundary of the set 5" as a function of the second coordinate. 
If d > 2 (and hence the generalized inverse is not defined), then an analogous description is needed 
for all coordinates, i.e. we need d different representations of the set S of the form 



5 = {x € [0,00)^^1 Xi>gi(x_i)}, l<i<d, 



(2.7) 



where x_j G [0, 00)"^"^ denotes the vector x with ith coordinate removed and qi are suitable [0, 00]- 
valued functions that are decreasing in each argument. Then one may proceed as in the case d = 2 
by separately examining the infiuence of the transformation of each marginal on the i^-measure 
of the (suitably restricted) set S. Under suitable integrability conditions on the functions qi and 
obvious generalizations of the conditions (M1)-(M3), (Dl), (D2) and (S1)-(S3), it can be shown 
that 



k^/^dniPn -Pn) 




d 
= ^Wn(7i)" 




i=l 


-r 



'^J qi{v)r]{qi{v))hQ,oo){qi{v))X'^-^{dv), 
- /3_Ii) j{q^{v)Y~^-r^{Uv))h,,ooMv)) 
J qi{v)ri{qi{v))l(^0^^){q,{v)) l'^-^{dv), 



,d-l 



(dv) 



(2.8) 
7i>0 

7i < +op{w„{ji)) 
7i = 



Here A denotes the Lebesgue measure on [0, 00)"^ ^ and qi{v) is the vector in [0, 00)'^ whose ith 
coordinate equals qi{v) and the other d — 1 coordinates are those of v. 

If the boundary dS of the set S is sufficiently smooth, then the integrals on the right-hand side 
of ()2.8p can be represented more naturally as integrals w.r.t. certain differential forms (see, e.g., 
Schreiber, 1977, for an informal introduction to differential forms). More precisely, assume that 
there exists a set D C [0,oo) and a continuously differentiable function q : D ^ [0, 00), such 
that dS = {{u,q{u)) \ u € D} and the map ^ : Z) ^ [0, cxd)*^, 'l'(ti) = {u,q{u)) has a Jacobian 
which is invertible everywhere. Then the right-hand side of p.Sp equals 



^Wniji) 



.r» 



Jq, PTi ■ rj dxi A • • • A dx-i-i A dxj+i A • • • A dxd, 



(f - /3_^) j^{pnf~^^ ■7]dxiA---A dx^-i A dxi+i A 
— Fj J^ pvi ■ T] dxi A • • • A dxi-i A dxj+i A • • • A dxd, 



7*>0 
■ A dxd, 7* < +op{wn{li)) 
7i = 



with pri denoting the projection to the ith coordinate. This representation refiects most clearly 
the fact that the ith term results from the change of the boundary surface of S by the marginal 
transformation Hn^i- It is worth mentioning that such a representation can be derived for more 
general differentiable manifolds dS. 
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3 Analysis of insurance claims 

In this section, we discuss issues arising in practical applications. In particular, we will see that, 
though in the asymptotic setting the constants k and e^, which are chosen by the statistician, play 
different roles, from a different perspective essentially only the product A;e„ matters. 
As an example, we consider a well-known data set of claims to Danish fire insurances. The data 
set contains losses to building(s), losses to contents and losses to profits (caused by the same fire) 
observed in the period 01/1980 - 12/2002, discounted to 07/1985. The claims are recorded only if 
the sum of all components exceeds 1 million Danish Kroner (DKK). Note that due to this recording 
method, there is an artificial negative dependence between the components, since if one component 
is smaller than 1 million DKK, the sum of the others must be accordingly larger. To avoid this 
effect, we therefore consider only those claims for which at least one component exceeds 1 million 
DKK, which leads to a sample of 3976 claims. Moreover, we focus on the losses to buildings, 
denoted by Xi as a multiple of one million DKK, and the losses to contents 1^. A more detailed 
description of the data can be found in Miiller (2008) and Drees and Miiller (2008). 
As described in the introduction, we assume that a quota reinsurance pays (1 — ax)Xi for each 
loss Xi to the building and (1 — ay)l^ for each loss Yi of content, while an XL-reinsurance pays if 
the remaining costs axXi + ayYi exceed a retention level R. We want to estimate the probability 
Pn '■= P{Dn) with Dn '■= {axXi + ayYi > R} that a fire results in a claim to the XL-reinsurance. 
(More precisely, we estimate the conditional probability given that max(Xj,l^) > 1.) 
Miiller (2008), Section 5.1.2, fitted the following GPD's to the marginal distributions using the 
Hill estimators based on the ki = 900 and k2 = 600 largest observations: 

Fi{x) := I - (l + ^i ^ , i = l,2, (3.1) 

with parameters 71 = 0.57, di = 0.54, fii = 0.91, 72 = 0.72, 02 = 0.47 and (12 = 0.15. (These 
approximations are sufficient accurate for x satisfying 1 — Fi[x) < ki/n.) Moreover, he showed 
that both components of the claim vector are apparently asymptotically dependent. 
Note that U*^ := 1/(1 — Fi) can also be interpreted as an estimator {n/k)T^^ for different values 
of k. However, the number k does not have any operational meaning if one starts with a given 
approximation of the marginal tails as in (|3.1|) . It that case it seems more natural to reformulate 
our estimator p„, the main result (j2.5p and the resulting confidence interval (|2.6p in terms of U^. 
First, note that the estimator of the failure probability 

depends on the constants k and e„ only via their product A;e„ (if the tail estimator U^ is considered 
fixed). At first glance, this seems peculiar, because in Theorem 12. II the estimation error seemingly 
depends on k and e„ in completely different ways. However, note that according to the discussion 
following Theorem 12.11 for 71,72 > 0, approximation (|2.5p can be rewritten as 

Pn-Pn = (ken)-'/' log ^Nil + Op(l)) (3.2) 

n 
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for a centered Gaussian random variable N with variance 

ki \en^ ^2 ^e„/ \\A;e„ / \fee„ ) ) en^n 

where /i := {en/dn) J^^-^ q^{v)'q{q^{v),v) dv and /2 := (e„/d„) /^^ q{u)r]{u,q{u)) du. Thus Ii/e„ 
does not depend on e„, and the distribution of the approximating Gaussian random variable on 
the right-hand side of (|3.2p depends on k and e„ only via their product. 
Moreover, also the estimators 

1 1 " 

= ^•^E^c/-(x.y.)({«l-^>'^) I {u,v)e^U^{D^)]\ 

n n ^_^ n 

[{{l+ln)u,v) I {u,v)e^U^{Dn)}) 



and likewise in,2/^n depend on the product A;e„ only. Finally, the covariance term i^(A;2/(A;e„,), oo) x 
(fci/(/ce„), oo)) = k'^en/{Xkik2)iy{{k/{Xki),oo) x (A;/(AA;2), oo)) can be estimated by 



n 



k en ^ ( ( k \ ( k \\ ken sr^ ( ( n \ f n 

Here the choice A € (0, 1] ensures that W^ is used only on the range where it is a sufficiently 
accurate estimator of the true function U^ . 

To sum up, all estimates only depend on kcn, but not on the numbers k and e^ separately. This 
product should be chosen as large as possible under the constraints that both marginal approxi- 
mations of U^ by U^ and the approximation of the joint distribution of the standardized vector 
(cf. (|1.2p ) are reliable. To ensure the former constraint, for the vast majority of the observations 
{Xi,Yi), the indicator of the set {U^{Xi,Yi) € n/ {ken)U^ (Dn)} should not depend on the par- 
ticular values of U^{Xi) or C/2~(^) i^ these are smaller than n/ki or n/k2 (either because the 
other component of the vector is so large that the observations lie in the failure set anyway, or 
because the other component is so small so that the indicator is even if the maximal value 
n/ki is attained). To be more concrete, for the failure set -D„ := {{x,y) \ aix + a2y > R} 
introduced above, kcn should be smaller than in.mi=i^2 kiU^{R/ai), because otherwise for sure 
U^{x,y) € {n/ ken)U^ (Dn) for some values {x,y) for which U^(x,y) is not a reliable estimate of 
U^{x,y). 

However, the above crude upper bound ken is not sufficient to ensure that pn is a reliable estimate 
oi pn, because the dependence structure must be accurately described by the exponent measure u, 
too. We propose in analogy to the well-known Hill plot, to plot pn versus ken and then to choose 
ken in a range where this curve seems stable. In Figure 1 such a graph is shown for the Danish 
fire insurance data and the failure set Dn = {{x,y) \ x + 0.5y > 100} for values of fce„ ranging 
from 10^ to 5 • 10^. Note that the aforementioned crude upper bound on ken is about 1.7- 10^, but 
the curve of probability estimates shows a clear downward trend for ken > 2 • 10^, which is most 
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Figure 1: p„ (solid blue line) and confidence intervals (black dashed line) versus ken for Danish 
fire insurance claims 



likely due to a deviation of the dependence structure from its limit. On the other hand, for values 
smaller than 5 • 10^ the curve is very unstable, too, because the random error is still too large as 
just a few observations fall into the inflated failure set (e.g., about 25 if /ce„ ~ 3 • 10^). This lower 
bound on kcn reflects the condition in the asymptotic framework that n is of smaller order than 
knSn (see condition (SI)). In view of this plot, the choice ken = 2 • 10^ seems reasonable. 
In addition. Figure 1 shows a two-sided confidence interval with nominal size 0.95 again as a 
function of /ce„. Here we have chosen £n = 0.1 and A = 1 in the estimator of the variance af^ 
described above; other values of A between 1/2 and 1 yield essentially the same estimates, while 
smaller values of in lead to larger fluctuations in the confidence bounds, that however are still of 
a similar size. 

For kcn = 2 • 10^ one obtains a point estimate for pn of about 8.8 • 10~^ and a confidence interval 
[2.2 • 10~^, 1.54-10"^]. At first glance, this confidence interval seems rather wide. However, one has 
to be aware of the fact that we estimate the probability of a very rare event which has occurred 
only twice in the observational period of more than 20 years. Indeed the empirical probability 
of the event is about 5 • 10~^, and the Clopper-Pearson confidence interval [6 • 10~^, 1.8 • 10^^] 
(again with nominal size 0.95) is even wider. It is worth mentioning that both the empirical point 
estimate and the Clopper-Pearson confidence interval are exactly the same if one wants to estimate 
the probability that a claim occurs to the XL-reinsurance for any retention level R between 77 and 
145 million DKK! Moreover, for retention level above 152 million DKK the point estimate would 
be and thus useless for purposes of risk management. 



4 Proofs 



The proof of Theorem 12.11 is based on the decomposition ()1.15p of the estimation error. The 
asymptotic behavior of the leading term IV + y is established in Corollary 14.51 As a preparation 
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for this result, first we establish an approximation of the random transformation of the marginals 
defined in (|1.12p . Thereby we must restrict ourselves to arguments that are neither too small nor 
too large, which defines a certain subset 5"* of S. In Lemma B?^ an upper bound on the difference 
between the z^-measures of S and S* is derived, while the Lemmas 14. 31 and 14.41 analvze the influence 
of the marginal transformations on the i/- measure of 5*. 

In Lemma 14.61 an upper bound on the term // of decomposition (jl.lSp is proved using empirical 
process theory. Finally, Lemma [4.8l establishes upper bounds on the terms / and ///, while Lemma 
9\ takes care of VI. 



Lemma 4.1. Assume that the conditions (M1)-(M3) and (SI) are fulfilled. For i € {1,2}, let 
^n,i > be a decreasing and Tn^i < oo an increasing sequence, such that the following conditions 
are met: 

(i) Ai{n/k){Xn,idnk/ny'^^ = o{k-'^/'^Wni'yi)) for some e>0 

(ii) Ifji > 0, then k~^/'^ = o{{\n,idn/enV'). 

(Hi) Ifji < 0, then k^^/"^ = o{{Tn,idnk/n)'^') and log{dn/en) = o{{dnk / n)"^'^) 

(iv) If -ii = 0, then k^^/'^\ogTn,i -> and log((i„/e„) = o(logc„) 

Then, fori G {1,2}, 

-^Hndix) = T^i o TnA o T'^^i^ ° Ui{dnX) 



e, 



^ , I -k~^/^\ogCn{^ + Op{l)) + Op{k-^l\xdn/en)-^^), 7i > 

— X ( 1 + <j k-^l^{dnk/n)-^^ {{aih, - ft - r,/72 + op(l))x-^» + op(l)) , 7i < 
-k-y^\og^Cn{Tj2 + op{l)) + Op{k-^lnogCn\ogx), 7i = 



-n 



uniformly for x G [A„,i, r„,j]. 

Proof. For notational simplicity, we omit all indices and arguments of the marginal parameters and 
normalizing functions and their estimators; e.g., we use a as a short form of ai[n/k). Moreover, 
we drop all indices referring to the ith marginal, i.e., we write U instead of C/j, T„ instead of T„^j 
and so on. 
By (ll^D, for all < e < \p\, 

Ai(x) := ^^"'"^^ ~ ^ - (^^"^/^)^ - 1 = 0{A{n/k){xdnk/ny+P^') = o{k-"''wn{l){xdnk/ny) 
a J 

(4.1) 

uniformly for all x > A„, where in the last step we have used condition (i). Now one can conclude 

that U{dnx) G r„((0, oo)) for all x G [A„, r„] with probability tending to 1. For example, if 7 > 0, 

then we have to show that C/(d„x) > b — a/7 for all x > Xn or, equivalently, (using (M3)) that 

A,(A.) 11^ i^ _ A _ i^ndnk/nr - 1 ^ Udnk^y ^ ^(.-l/^) 

a 07 7 7 V n / 

which follows immediately from (14. ip . (SI) and (ii). 
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Hence 

r„^of„ofW^oC/(d„a;) 



l + ^(a^^^^ 2 L L + 6-6 

a V 7 



1/7 



=: i/fa;) 



if the expression in brackets is strictly positive, which wiU indeed follow from the calculations 
below. 

Now direct calculations show that 

H{x) 
By assumption (M3) 



1+7 c„^ + (- —7 



7 



a a 



1/7 



(4.2) 



a a 
If 7 > 0, then the Taylor expansion 



A2 := - - ^-^7 - 1 = k-^^\c, - 7/3 + op(l)). 



(4.3) 



Cn^/^ = C-' (l - fc-i/2_ log c„ + op(A:-i/2 log c„)) 
together with (jiJ]) . (jOj) and (SI) imphes 



^(x) 



d„A; \7 



n 



^ n 



7, 



7 7 



1 + 7Ai(x) + 4(1 + As) + 1 - 4 - 4A2 
7 / 7 7 

d„k \ -7N 



1/7 



l + Op((|A,(x)| + ^-V2)(^,)-^)+Op(,-V24(^ 



(Xr) fC \ / 



1/7 



= ^Jl- k-'/'- logCn + Op(A;-l/2 logC„; 

en ^ 7 

from which the assertion follows readily. 
If 7 < 0, then similar arguments prove 



1 + op(A;- 1/2 log c„) + Op (k-y^ f^x' ' 



H{x) = ^x{l + Op{k-'/^logCn)) 



i+k-'/2}_U-jp--+op{i))(^xY\op{k-y'wnm) 

7 V 7 / \ n / 



and hence the assertion, because the assumption (iii) ensures that logc„ = o(u7„(7)). 
Finally, for 7 = 0, the Taylor expansion 



c„^ = 1 - 7 log c„ + -7^ log2 c„ + Op (7=^ log^ Cn) 



yields 



H(x) 



exp 



:i-7logc„ + Op(fc-ilog2c„))(log(^x)+Ai(x)) + 
1 



+ ( - log C„ + -7 log2 C„ + Op{k-^ log3 Cn)) (1 + A2) 
fl!r),/v 



CnU 
dn 



-xexp 



7 log Cn ( log C„ + log ( —x] ] +Op{k ^/^ log^ Cn) + -7 log^ Cn 



l--{T + Op{l))k-^l^\og^Cn + Op{k'^IHogCn\ogx) 



which concludes the proof. 



D 
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In what follows we denote by A„^i \ xi, A„^2 \ 9(00) := limx-s>oo 9(2^) and t„^j t 00, i G {1,2}, 
sequences which satisfy the conditions of Lemma 14. 1[ (These sequences will be specified in the 
proof of Corollary 14.51 ) Note that in particular constant sequences \n,i,Tn,i G (0,oo) satisfy the 
conditions of Lemma 14.11 provided 

Ai{n/k)cP^+^ = o(fc-^/2^„(7i)) for i G {1, 2} and some e > (4.4) 

and (SI) holds. Therefore we may and will choose 

\n,i = xi if xi := inf{x > | q{x) < 00} > 0, 
An,2 = q{oo) if q{oo) > 0, 

We want to apply the approximations just established to points {x,y) on the boundary of S. To 
ensure that x G [An,i,Tn^i] and y G [Xn,2,Tn,2], we consider a subset 5* of S that is bounded away 
from the coordinate axes. More precisely, we define 

5;:=5n([<,oo) X K,oo)) 

with 

Un ■= An,i V q^{Tn,2), < := K,2 V g(r„,i). 

The following lemma implies that the z^-measure of the set S \ 5" * is asymptotically negligible. 
Lemma 4.2. 

fQ\ ia*\ - n>( ^n,i-xi , q{T.a,i) - q{oo) ^ Xn,2 - qjoo) , q^{rn,2)-xi 

with qix-) := limj^^. g(t). 

Proof. First note that S C [xi,oo) x [q{oo),oo) implies 
i^{S)-i^{Sn{[0,ul) X [0,00))) <i^{[xi,X„„i) X [g(A„,i-),oo)) +i^{[xi,q^{Tn,2)) X [t„,2,oo)). 



The spectral density Lp is assumed continuous and hence it is bounded. From (j2.4p we conclude 
that for arbitrary < uq < ui and f > 



"1 /"^ - - .- ^ ^ui-uo 



iy{[uo,ui)x[vo,oo)) =0( / {u'^ + v'^)'^/'^ dvdu) = o( 

and thus 

Kg)-.(^n ([0,0 X [0,00))) =o(%l^ + ^"^V^O - 

Vg^(A„,i-) r^2 ^ 

Likewise, one can show that 

i/(5n([o,n„) X [0,00))) -z^(S'„) = 0( 2 + 7:;wi^ — v;2" 

A combination of these two bounds yields the assertion. D 
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On the set S* we can now use the approximation from Leinina [4.1l to first examine the influence of 
the transformation Hn^2 of the second coordinate on the z/- measure of S* . In a second step we then 
similarly determine how the z/-measure of this transformed set is altered by the transformation 
Hn^i of the first coordinate. Hereby note that by to Lemma |4. II the marginal transformations are 
invertible with probability tending to 1. 

Lemma 4.3. Let Hn{x,y) := {Hn,i{x),Hn,2{y)) ■= ^T^ofnof^f^'^oU{dnX,dny)- Suppose that 
the conditions (D2) and (Ql) are met. 

Then one has with qn{u) := q{u) V u* and q^{v) := q^ {H^2{'^)) V u*^ 
iy{Hn{Sl)) - v{Sl) 

f'OO /•OO 

+ / {Hn,2{qn{u))-qn{u))ri{u,qn{u))du+ / [Hn,i{qt;;{v))-qn{v))'n{qn{v),v)dv 

fOO /'OO 

o\ I \Hn^2{qn{u))-qn{u)\r]{u,qniu))du+ I \Hn,i{qt{ij))-Qtiv)\v{Qtiv),^)d^ 



-f^n,2«) 



(4.6) 



with probability tending to 1. 




Figure 2: The light and mid grey regions show the approximation 5* of the set 5", the mid and the 
dark grey regions the symmetric difference between {{x,Hn,2{y)) \ ix,y) G 5"*} and 5*, where the 
dark grey region is counted with a positive sign, the mid grey region with a negative sign. (Here 
it is assumed that u* = q'^{Tn,2)-) 

Proof. According to the proof of Lemma HTTJ for all 6 G (0, 1), on the set [\n,i{l — 6),Tn^i{l + 5)] the 
transformation H^^i is continuous and strictly increasing and Hn,i{x) = x(l + o(l)) with probability 
tending to 1. 
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We first quantify the influence of the transformation of the second coordinate. Note that 

/■OO /■OO 

1^(5"*) = / / vi'^1 ^) dv du 



«* Jqn{u) 

OO fOO 



/■OO r 
u{{x,Hn,2{y))\{^^y)^Sl} = / / 

Jut Jh 



ri{u, v) dv du 



and hence 



i'{{x,Hn^2{y)) I ix,y) G 5"*,} - u{S^) 



n JH„a{qn{u)) 

OO fH„^2(qn(u)) 



«* -'gn(«) 



?7(u, u) dv du. 



The inner integral equals 

-H'n,2((?n('")) 



T7(u, i;) di; 



(4.7) 



(4.8) 



qn{u) 



f^",2(<i"(^))/i"-(^) /ri(u,qn(u)w) \ 
= {Hn,2{qn(u)) -qn{u))rj{u,qn{u))+ — -^-^^-— — -1 dwr/(M,g„(M))g„(M). 

By the assumptions and Lemma WA\ Hn^2iQniu))/qrt{u) — )• 1 uniformly for u € (ti*,oo) as (7ri('u) ^ 
[An,2)''"n,2] for u > u^. Recall that it is assumed that the spectral measure has density (f which 
is continuous and strictly positive so that ip is uniformly continuous and bounded away from 0. 
Thus, by (1121), 

r]{u,vw) _ f n^ + f^ \3/2yj(arctan^) 



r]{u,v) Ku^ + v'^w'^ ' (/^(arctan^) 
as, w ^f 1 uniformly for u,v > 0, since 



y2 _|_ ^2y^2 



1 + 



1-w' 



{u/vY + w"^ 



and 



(/?( arctan ^) |c^( arctan ^) — c^( arctan ^) 

info<,?<,r/2 ¥'(^) 



(/?( arctan ^) 
as w ^ 1 uniformly for u,v > hy the uniform continuity of (p and 

v/u 



VW V 

arctan arctan — 

u u 



W — 1\ < sup Tr-r- 



i + {{v/u){i + e{w-i))f 

for w > 1/2 and some 6 € (0, 1), which holds by the mean value theorem. 
Therefore, 

<'Hr..2{qn(u))/qr.{u) ^ rj{u, qn{u)w) 

'1 ^ Viu,qn{u)) 

which, combined with ()4.7p and ()4.8p . yields 



lit; — ll 



l)dw = o{Hn,2{qn{u))/qn{u) - l) 



u{{x, Hn,2{y)) I (2;, 2/) e 5*} - 1/(5"*) + / {Hn,2{qniu)) - qn{u))'n{u, qn{u)) du 

\Hn,2{qn{u)) -qn{u)\ri{u,qn{u))du). (4.9) 



20 



One can derive an analogous approximation of the difference between i'{(x,Hn,2iy)) \ ix,y) G S} 
and i'{{Hri^i{x),Hn^2{y)) \ {x,y) € 5} by similar arguments if one interchanges the order of 
integration: 



u{Hr,{S:))-u{{x,H^4y)) I {x,y) G S^} + / {Hr,,i{qt{v)) - qtiv))v{qtiv),v)dv 

oo , 

o\ / \Hn,i{qt{v))-qt{v)Hqt{v),v)dv). (4.10) 



'-H'n,2«) 

Summing up (|4.9p and (|4.10p . we arrive at the assertion. 



D 



In the next lemma, we calculate the limits of the integrals arising in Lemma 14.31 using the approx- 
imation established in Lemma |4.1[ 



Lemma 4.4. Suppose that the conditions of Lemm,a \4.3\ and, in addition, the following conditions 
are fulfilled for some xq G ixi,q^{q{oo))),yo E {q{oo),q{xi)): 



XO 



(q^iv))-^ ^^v ^ dv < oo or A^ ^^'^^ = o(log c„) 
{q{u))^~'^^u~^ du < oo or \~2'^ = o(logc„) 



Then the following approximations hold true: 



Wn{l2) Ju' 



Hn,2{qn{u)) - qn{u))ll{u,qn{u))du 



■^ I^Qiu)viu,qiu))du, 



72 > 
(f - ^2 - §) J^{q{u))'-^^v{u,q{u))du, 72 < 
-r2 f^ q{u)r]{u, q{u)) du, 72 = 



Moreover, 



\Hn,2{qn(.u)) - qn{u)\r]{u,qn{u)) du = 0{k ^/^■a'n(72))- 



(U) 



f^l/2 .00 



r Ti TOO 



Hn,iiqtiv)) -qtiv))r]iqtiv),v)dv 



^IgToo)q"i^Hq^i^)^^)dv, 71 >o 

(t -/3i - §) I,^^){q'-{v))'~'''r^{q^{v),v)dv, 71 < 



Furthermore, 



/•oo 

/ \Hn,iiqtiv)) - qtiv)\viqtiv), v) dv = 0{k-^'^Wn{li)). 



(4.11) 
(4.12) 
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Proof, ad (i): Because the spectral density ip is bounded, there exists a constant K > such 
that 

ri{u,q{u)) <K{u^ + {q{u)fy^^^ <K{u-^ A{q{u))-^) Vtx > 0. (4.13) 

Hence qn{u)rj{u,qn{u)) < K{q{u))~'^ for u € [x/,Xo] and n sufficiently large and qn{u)rj{u,qn{u)) < 
Kq{xQ)u~^ for u > xq. Therefore, 

lim / qn{u)r]{u,qn{u))du = / q{u)ri{u, q{u)) du < oo (4-14) 

by the dominated convergence theorem and tt* i x^ . 

Now, we distinguish three cases. 

If 72 > 0, then by Lemma [d] and d„ x e„ 

:; / (Hn,2{qn{u)) -qn{u))r]{u,qn{u))du = -( hop(l)) / qn{u)r]{u,qn{u))du 

log c„ y„. V 72 / y„* 

+ 0p(-^ r {qn{u)f-^^ri{u,qn{u))du). 

WogCn Jul J 

Because of (liT3]l and (liJ2]) 



(g„(n))i-^27y(u,g„(u))(in < K{q{x^))-''-^\x^-xi)^K \ {(i{u)y\n:if'^^u-^ du = o(logc„). 



Xo 



(4.15) 
Hence, in view of (j4.14p . we have 

f (i:f„,2(gn(it))-gn(w.))??(w., g'n(w.))t?w, = -A;"^/^logc„— / q{u)r]{u, q{u)) du+op{k'^^'^ logCn). 

M* 72 Jx; 

If 72 < 0, then the assertion follows similarly from Lemma l4. II and (I4.13p . 
Finally, in the case 72 = 

f°° \logq(x)\ r°° 

/ qn{u)\logqn{u)\ri{u,qn{u))du < K sup ———-^ + K snp q{x)\logq{x)\ u^^ du < 00. 

Jxi x<xo [Q\^)) x>xo Jxo 

(4.16) 
Hence, similarly as in the first case, we may conclude the assertion from Lemma l4. 11 

ad (ii): The second assertion can be proved in a very similar fashion using q{Hn^2{Vn)) ~^ Qi^o) 
and the fact that q^{u) — )■ q'^{u) for Lebesgue-almost all u > q{oo), because of Lemma F4.1l and the 
Lebesgue-almost surely continuity of q^. For that reason, we only give the analog to the bound 
(|4.15p for the integral under consideration in the case 71 > 0. 
For 2/0 £ (9(00), ^(a;;)) and all sufficiently large n, we have 

ryo 
/ {qt{v))'-^'v{qtiv),v) dv < K{qt{yo)r'-^'{yo " ^(oo)) = 0(1). 

If 71 ^ 1 5 then 

/"OO POO 

{qt{v))^-'"^[qt{v),v) dv < K{qt{yo)f~^' / v'^ dv = 0(1). 



yo -Jyo 
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Finally, if 71 > 1, then by the monotonicity of q^ and the asymptotic behavior of -ffn,2 we have 
for all (5 > and sufficiently large n 



yo -Jyo 

00 



= 0( {q^{v))'-^^v-'dvAXl-,^')=o{logCn) 

^ Jyo{i+S) ' 

by condition (|iTTT1) . D 

The following result gives sufficient conditions such that the difference between the i^-measure of 
5" and of the truncated set after the marginal transformations (i.e. d^ilV + V) in (|1.15|) ) can be 
approximated by the limiting terms in Lemma 14.41 For the sake of simplicity, we assume that 
dn and e„ are of the same order, but it is not difficult to prove similar results under weaker 
conditions on d„/e„. Moreover, one can weaken the condition (S2) and the assumptions (Q2) 
could be replaced with rather strong conditions on the rate at which k tends to 00. 

Corollary 4.5. // the conditions (M1)~(M3), (D2), (Ql), (Q2) and (S1)^(S3) are fulfilled, then 
u{Hn{Sl)) - u{S) 



k-'/^WnM 



+ k-^/^Wnh2) 



;t - ^1 - ^) I,°^^){q''{v)y-'^7^{q^{v),v) dv, 71 < 

-ri/g7oo)^^(^)^(«^(^)''^)^'^' 71=0 

-^ LT Qi^)v{u, q{u)) du, 72 > 

(02 



h - ^) !;^{q{u)Y-'-^{u,q{u))du, 72 < 
"^2 !^ q{u)r]{u, q{u)) du, 72 = 

+ Op{k-^/\wni7l)VWni72))) (4.17) 

Proof. In view of the Lemmas 14.21 - 11^ it suffices to define sequences An,i and Tn,i, i € {1, 2}, such 
that the conditions (i)-(iv) of Lemma 14.11 and (14. IIP and (I4.12p are fulfilled and 

K,l-Xl q{Tn,l) - q{oo) .1/2 , .N Xn,2-q{oo) g^(r„_2) " X; ,1/2 , .x 

TiK^)^ <^ =^^' "'^^^^^^' (.-(A„,2)P + <2 ="^' ""^"^^^- 

Note that we can check these conditions for i = 1 and i = 2 separately. We focus on the sequences 
A„^i and r„_i, since the case i = 2 can be treated analogously if xi is replaced with g(oo) and q 
with q'^ . Again we distinguish three cases depending on the sign of 71. 

If 7i > 0, then x„^i must only satisfy {q{Tn,i) — g(oo))/r^ ^ = o(A;^-'^/^ logc„), which can easily be 

fulfilled by letting r„^i tend to 00 sufficiently fast. 

The sequence A^^i has to satisfy the conditions (i) and (ii) of Lemma 14.11 (j4.1ip and (A„^i — 

xi)/q'^{Xn,i) = o{k~^''^ log Cn)- If xi > 0, then A„,i = x; does the job, because condition (i) of 

Lemma l4. II is implied by (S2). 

If X; = and 71 < 1, then the integrability condition of (j4.1ip is trivial. Moreover, An,i := 

A:-V2(logc„)i/2 ^ obviously fulfills O (ii) and (A„,i - xi)/q^{Xn,i) = 0(A„,i) = o(fc-V2 iogc„). 

Condition 14.11 (i) follows from (S2) and (S3), which implies c„A„_i — > 00. 
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Finally, if x; = and 71 > 1, then A„,i := {k-^l^\ogCnY/^^ fulfills O (ii) , O (i) follows from (S2) 
and (S3) as above, and (Q2) implies 

t{K,i) V|logA„,i|V V |log(A;-V2logc„,)|2y 

by (SI). Furthermore, the integrability condition of (|4.1ip is fulfilled, because (Q2) implies 

(?;/logt;)2/(i-Ti) = 0{q^{v)) as v -;> 00. 

Next we consider the case —1/2 < 71 < 0, when the integrability condition of (j4.1ip is trivial. If 
xi > 0, then we can argue as above that A„_i = xi satisfies all conditions on An,i. If xi = 0, then 
define A^^i = c~^'fn for some (/9„ — >■ 00 sufficiently slowly, so that 14. II (i) follows from (S2). Further 
{^n,i — xi)/q^[\n,i) = 0{c~^ipn) = o{k~^''^Cn^^) follows from assumption (S3). 
The conditions on r„^i read as ((?(t„_i) — g(oo))/r^]^ = o{k~^''^Cn'^^) and k"^''^ = o((c„t„^i)'''^) in 
this case, which are fulfilled by r„^i = k^''^o}i — > 00. 

In the case 71 = the integrability condition of (|4.1ip is again trivial and A„_i = x; if xi > 0, and 
•^n,i = c~^logc„ if xi = does the job. Moreover, it is easily checked that Tn,i = k^'^ satisfies 
(?(■'"«, 1) - Q{^))/'^n,i = o{k~^^'^ log^ c„) and condition 14.11 (iv). D 

Observe that we have verified stronger conditions on An,i and Tn,i than actually necessary, if 
""^^(71) = o(u;„(72)). A refined analysis would lead to weaker, but more complex conditions on 
q and k that depend on both the values of 71 and 72 at the same time. (Also the proof would 
become more lengthy as one had to consider 9 cases arising from different combinations of signs 
of 71 and 72.) Moreover, note that for the above choice of \n,i one has 

CnAn.i^oo, ie{l,2}, (4.18) 

and 

K7=0{k^'V^o^Cn) if7i>0, iG{l,2}. (4.19) 

Now we use classical empirical process theory to establish a uniform bound on fn^B) — Evn{B) 
and thus on term // in decomposition (|1.15p . 

Lemma 4.6. Under the conditions of Theorem \2.1\ one has 

Vn{B) - EuniB)\B=(d„/e„)H„{S^) = op (A;"^/^(u;„(7i) V "«;„ (72))) . 



Proof. Note that by (14. 2p one has 

:i/„(x„X2)=(^^;i;,,,(xO,^S,6M 
for (a;i,a;2) G [u^,oo) x [t'i^,oo) with -ff^"'*. f. defined by (|2.3p and 

_^. _ ^. _ ^1/2.^. _ ^.^ .. _ .1/2 ( a^{n/k) kju/k) - hjn/k) \ 

Since, according to condition (M3), these random variables are stochastically bounded, it suffices 
to prove that for all M > 



sup 

'^M\H^X^l\(i^\)<M 



-"«ix.«.).^i,J - ^-"(^(S.x.C.).^i,J = op{k-"\w^{l,) V ^„(72))) 
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where 

Letting 6 := {'&i,Xi,Ci)i=i,2 and 

we have to prove that Zn tends to in probability uniformly. To this end, we establish asymptotic 
equicontinuity oi Zn, i.e. 

limlimsupPJ sup |Z„((9) - Z„(?/^)| > t?) = Vr? > 0, (4.20) 

and convergence in probability of Zn{0) for all 6 E [— M, M]^ (see van der Vaart and Wellner, 2000, 
Theorem 1.5.7). 

For the proof of asymptotic equicontinuity, it is crucial that the functions i?,,' c ixi) are decreas- 
ing in all three parameters for all (a;i,X2) G [m*,oo) x [f*,(X)). For ^j resp. "di this nionotonicity is 
an immediate consequence of the facts that {cn^ — l)/7 is negative and increasing in 7 (for c„, > 1) 
and that (1 + ^tY'"' is increasing in t. Because Cn'^' is a decreasing function of 7, the monotonicity 
in Xi follows from (12.21) , (I4.18P and condition (i) of Lemma 14.11 which imply 

ai[n/k) 7j 

= ho((c„Xi)^»A; '/^u;„(7i)j 

> 



for sufficiently large n. 



The monotonicity of i/ "' (xj) implies that the sets EVI c\. are increasing in all parameters. 



Hence, for arbitrary 9,ip ^ [— Af , M] 

it'n(7i) Vw„(7: 



^n(^) - Znin < Z77ZX^7Z^7Zr^i^n{EiZi \ E^i^^) + Eu^iE^^^ \ eI%)) 



where 9\/ ip resp. 9 Aip denote the coordinatewise maximum resp. minimum of 6 and ip. 
To establish asymptotic equicontinuity of Zn, we cover the parameter space [—M,M]^ with hy- 
percubes // := 'xf=i[liS,{li + 1)6], —\M/5] < k < [M/6\, for some small 6 > (depending on 
the value rj in ()4.20p ) to be specified later on. For 61, ■0 S [-M,Mf with \\9 - ip\\oo < 6 and 
1(e) := (L^iAJ)i<i<e, one has \\l{e) - l{^P)\\ < 1 and thus 

\Zn{9)-Znm 

< \Zn{9) - Zn{l{9)5)\ + \ZnW " ^„(/(V')5)| + |Z„(/(e)5) - Z„(/(V)<5)| 

< 3 max sup \Zn(t) — Zn(u)\ 

le{-\M/5-],...,[M/5i}6t,ueIt 
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where (/ + 1)5 := ((/j + l)5)i<i<Q. By (Dl), the expectation can be approximated as follows: 
EME^l^s \ eIs^) = jPlTriX, Y) G 4"),), \ E}f} = u{E^l^, \ E\f) + 0{A,{n/k)). (4.22) 

To bound the right-hand side, first note that by similar calculations as in the proof of Lemma HTTJ 
one obtains 

^ / r -A:-i/2iogc„(^ + op(l)) + Op(A;-i/2(3;d„/e„)-7.), 7, > x 

= ^xil+l A;-i/2(d„A:/n)-^»(te/7i+^./7f + op(l))x-^'+op(l)), 7* < 

""" V [ -k~^lHog^Cn{Xi + ^r/2 + Op{l))+Op{k-^lHogCn\ogx), 7, = / 

uniformly for x € [An,j,Tn,i]. That means that under the same conditions as in Lemma l4. II one 
can prove an analogous approximation where Fj is replaced with Xi if 7« > 0, ai/7i — /3i — ^i/jf is 
replaced with ^i/yi + t?j/7j^ if 7j < 0, and Lj is replaced with 2xi + ^i in the case 7j = 0. Hence, 
we may also conclude a corresponding analog to Corollary 14.51 i.e. z^((e„/(i„)£';, ^ . ) — i'(S) 
equals the right-hand side of (|4.17p with the above substitutions. Because all integrals are finite, 
there exists a constant K > such that for sufficiently large n 

U{E^I^^) - l^iElf) < ^K5k-'/\wn{jl)VWn{j2)) 

uniformly for all / G {—[A// (5] . . . [M/5J}^. A combination with (j4.22p . e^ x dn and condition 
(S2) shows that to each 77 > there exists 6 > such that for sufficiently large n 

EMe'^I^s \ E\f) < ^k-'/\wn{ji) V ^„(72)). (4.23) 

In view of (|4.2ip . we obtain 

P\ sup \Znie) - ZnWl > v} 

^ e,tp€[~M,M]<^,\\e-'ip\\^<5 -" 

> |fc"^/^(w'n(7l)VU'„(72))} 



/g{_|-M/51...LAf/5J}6 

^fc-V2 



/<5 J 



> ^fc-^/^(u;„(7i)Vu;„(72))}. 

Therefore the asserted asymptotic equicontinuity ()4.20p follows from ()4.23p and Chebyshev's in- 
equality applied to the binomial random variables kun{E^J^^.^ \ E\^ ): 

P{\ME^iii)S \ E[f) - EME^l^s \ Elf)\ > ^fc-^/^K(7i) V u;.(72))} 
kE.4E^l^,\Elf) 



< 



{v/Q)^H{Wn{7l)yWn{72W 
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uniformly for all / E {-\M/^ . . . IM/5\Y. 

It remains to prove that ZniO) — )• in probability for all 6 G [— M, M]^. This, however, follows 

similarly by Chebyshev's inequality, (Dl) and the aforementioned analog to Corollary 14.51 

P{\Znm>ri} 



< 



rfk{Wn{ll) y Wn{l2)Y 

v{Et'^) + 0{Ao{n/k)) 
0. 
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Remark 4.7. Two remarks on this proof are in place. At first glance it seems peculiar that in the 
definition of H^'^^ ^ both parameters —"di and Xi take over the role of k^''^{'yi — ji) in the definition 

of H. This, however, is necessary to ensure the crucial monotonicity property of H^'y. c in the 
case 7j > 0. 

Secondly, we used the (slightly old-fashioned) classical approach to establish asymptotic equiconti- 
nuity instead of the often more elegant approach via bracketing numbers (see van der Vaart and 
Wellner (2000), Theorem 2.11.9), because the same approximation error of order 0(Ao(n//c)) 
in (Dl) always enters the upper bound on Evn{Ef?j^^-.^ \Ej^ ), thus impeding the calculation of 
bracketing numbers for radii of smaller order. 

Next we show that the terms / and /// in decomposition (ll.lSp are negligible. 
Lemma 4.8. If the conditions of Theorem \2.1\ are fulfilled, then 

1 /d, 



Pn - 



y(jlHn{S*S) = Op[d-^k-^'\wn{ll)yWn{l2))) (4.24) 

\en ' 

^{EMB)-l^{B))\B=(d„/e„)HAS^) = Op{d~'k-'/Hwn{7l)yWni72))). (4.25) 

Proof. As Pn = Vn{{dn/ en)Hn{S)) / Cn, the left-hand side of ()4.24p is non-negative with expectation 



^p[Tt{X,Y)e^Hn{S\Sl)] 



71 f d ^ 

< — P rr(X,y) e ^/7„((0,<) x [g«-),oo) U [<z^«),oo) x [g(oo),<)) 

= — (i/(i/„((0,Ox [g«-),oo)U[(7^«),oo) x [q{oo),v*J) + o{k-^^^{wn{-fi) V Wn{72)))) 

where we have used (Dl) and (S2). Now assertion (14.241) follows from Lemma 14.21 and the proof 
of Corollary 14. 5[ 
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Likewise, by conditions (Dl), (S2) and dn x e^, the left-hand side of (|4.25p equals 

D 

Finally, we derive a bound on term VI in decomposition (jl.lSp . 
Lemma 4.9. Under the assumptions of Theorem \2.1\ one has 

v{dnS) -pn = o(d~^/!;~^/^(w„(7i) Vi(;„(72))). 

Proof. With Xn,i,Tn,i as in Lemma HTTl we define for x € [Xn^i,Tn^i] 

Ui{dnX) -hi{dn)\^h- 



HlAx):=[l + ^, 



ai{d„ 



According to de Haan and Ferreira (2006), Theorem 2.3.6 and 2.3.7 one can choose ai{t) as a 
multiple of f^'- and 6j(t) = Ui{t) + 0{ai{t)Ai{t)). Thus, for Ai(a;) defined in the proof of Lemma 

Ui{dnx) - bi{dn) ai{n/k) (Ui{dnx) -bi{n/k) bi{dn) - bi{n / k) 



ai{dn) ai{dn) V ai{n/k) ai{n/k) 

( n \ii f{xdnk/n)^* — 1 ^ , , idnk/n)'^^—! , , ,\ ^, . , , .. 
= (^) {^-^ + A,(x) + i^L^J + A,(1))+0(^.K)) 



7j v \ n / / \ \ n 

where in the last step we have used ()4.ip . (14.18^ and the Potter bound for the regularly varying 
function Aq (de Haan and Ferreira (2006), Prop. B.1.9 5.). We conclude that 

Ui{dnX) - bi{dn) _^^,f, , ^f ^ t„iu\(^'^^^Y'^^\ , r^( A t ^ I u\ ( '^^^Y'^" ^-Ir 



Check that the first remainder term is of smaller order than k^^''^Wn{^i) by condition (i) of Lemma 
14.11 Moreover, for 7j > 0, (|4.19p and again condition (i) of Lemma |4. II implv 

A„(nA)(^)"%-- =0(A.(nA)(^)"''^fcV2/i,g,„) ^0, 

while for 7j < this convergence follows from the conditions (i) and (iii) of Lemma 14. H and for 7, 
it is obvious from condition (i). 

This shows that H^ -{x) is indeed well defined with 

o{k-^/^logCn) + 0{Ai{n/k){dnk/ny^+'x-^^), -/^ > \ 
H:^i{x) = x{l+{ o{k-y^{dnk/nr^^il + x-^^)), 7i<0 

0{k-y^ log^Cn), 7i = / 
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uniformly for x € [A„^i, r„^j]. Notice that this representation is of similar type as the approximation 
derived in Lemma 14.11 with all leading terms equal to (though in the case 7j > the second 
remainder term has a slightly different form). Therefore, we may proceed as before to conclude 



u{H:{S:))-u{S) = o{k-'/\wn{7i)VWn{^2)))+Y.O{A{n/k){dnk/nr'+^)^^^^o} 
= o(fc-i/2(u;„(7i) Vu;n(72))), 



i=l 



where the last equality follows from Lemma 14.11 (i) (cf. Corollary 14. 5p . 
To complete the proof, we must show that 

Pn - u{dnK{Sl)) = 0{d-^k-^'\wn{ll) V Wn{l2))) ■ 

This, however, follows from assumption (Dl) (with t = d^) in a similar way as (j4.24p . D 

Proof of Theorem 12. 1[ The assertion is a direct consequence of (J1.15p . Corollary 14.51 and of 
the Lemmas SSI WM and 1121 □ 



Proof of Corollary 12.21 First note that, similarly as for p^ one obtains the representation 
MSt^) = ^n{^H+{S)) with H+{x,y) := {Hr.,,{x), H+,{y)), 

K2iy) ■= T^:r o f^ o {flf^)r o C/(d„y) 

and c"*" := c+ := (1 + £n)n/{ken)- Thus Lemma HT] (with Cn replaced by 6^/(1 + in)) yields the 
approximation 

/ r -fc-V2iogc„(r2/72 + op(l)) + Op(A;-V2(yd„/e„)-^2)^ ^2 > 

H+2iy) = (1+4)2/ l+< A:-i/2(d„fc/n)-^2((«2/72 - 1^2 -r2hl + op{l))y~^^ + op{l)), 72 < 
V ( -fc-i/2iog2c„(r2/2 + op(l)) + Op(A:-i/2 log c„ logy), 72 = 

Now the very same arguments as used in the analysis of pn show that 

MSI2) = HS+,)+Op{k-'/\wn{7l)yWn{72))). 

Together with an analogous approximation for i'n{S~2) and our assumption on £„, we may conclude 
that 

dn f dn ^{Sn,2) " ^i^^^) , .. ^ 

— In,2 = ^^ + Op{l) 

{2£n)^ / r]{u,v) dv du 

00 

q{u)r]{u,q{u)) du. 

In the last step we have used the fact that, on the range of integration, r/(?x, v) is continuous and 
bounded by a multiple of u~^ V {q{u))~'^ (cf. ()2.4p ). so that the integrand of the outer integral can 
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easily be bounded by an integrable function and convergence follows by the dominated convergence 
theorem. D 
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